Fast Missing Value Imputation Using Ensemble of Soms
نویسندگان
چکیده
This report presents a methodology for missing value imputation. The methodology is based on an ensemble of Self-Organizing Maps (SOM), which is weighted using Nonnegative Least Squares algorithm. Instead of a need for lengthy validation procedure as when using single SOMs, the ensemble proceeds straight into final model building. Therefore, the methodology has very low computational time while retaining the accuracy. The performance is compared to other state-of-the-art methodologies using two real world databases from different fields.
منابع مشابه
Combination of SOMs for Fast Missing Value Imputation
This paper presents a methodology for missing value imputation. The methodology is based on a combination of Self-Organizing Maps (SOM), where combination is achieved by Nonnegative Least Squares algorithm. Instead of a need for validation as when using traditional SOMs, the combination proceeds straight into final model building. Therefore, the methodology has very low computational time. The ...
متن کاملLinear Combination of SOMs for Data Imputation: Application to Financial Problems Linear Combination of SOMs for Data Imputation: Application to Financial Problems
This paper presents a new methodology for missing value imputation in a database. The methodology combines the outputs of several Self-Organizing Maps in order to obtain an accurate filling for the missing values. The maps are combined using MultiResponse Sparse Regression and the Hannan-Quinn Information Criterion. The new combination methodology removes the need for any lengthy cross-validati...
متن کاملMissing data imputation in multivariable time series data
Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...
متن کاملSparse Linear Combination of SOMs for Data Imputation: Application to Financial Database
This paper presents a new methodology for missing value imputation in a database. The methodology combines the outputs of several Self-Organizing Maps in order to obtain an accurate filling for the missing values. The maps are combined using MultiResponse Sparse Regression and the Hannan-Quinn Information Criterion. The new combination methodology removes the need for any lengthy cross-validati...
متن کاملA classifier ensemble approach for the missing feature problem
OBJECTIVES Many classification problems must deal with data that contains missing values. In such cases data imputation is critical. This paper evaluates the performance of several statistical and machine learning imputation methods, including our novel multiple imputation ensemble approach, using different datasets. MATERIALS AND METHODS Several state-of-the-art approaches are compared using...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010